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attempt is being niad€ to improve subject accass to 
lonoqraphs by aagmenting HMC (flachir.e leadable Cataloging) records, 
fiforking'with a saaple of books drawn from the cQlleetions of the 
DniYersity of TorontQ and comprising a number of subject arias in the 
humanities and social BCiences^ the project plans to enlarge the MARC 
description by using a jset of decision rales for selecting words and 
phrases found in the index and/or the table of contents^ This first 
quarterly report for the period Jtine to August 1976 describes the 
sample of selected moiiographs^ summarizes the project budget to date^ 
and projects activities for the future. CSMH) 



^ Docuients acguirtd by lEIC include many informal unpublished * 

* materials not available fron other so^rcts* IRIC aiakes erery effort * 

* to obtain the best copy a?ailable, Nevertheless, Itess of marginal ^ 

* reproducibility are often encountered and this a£f€ctB the quality * 

* of the licrofiche and hardcopy reprodnct ions ERIC niakes available ^ 

* ?ia the IRIC Document Eeproduction Service (EDRS) • EDRS is not ^ ♦ 

* responsihlt for the quality of the original doctinent, leproductions * 

* supplied by IDiS are the best that can be made froa the original. . * 
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Paulina Atherton - Project Diractor 
3.K,L* Qmrovp^ - Assistaiit Project Director 
Bette Brindie - Research AssDci£te 
Barbara Set eel ^ Research A?Hslstant 
Charyl Willson - Siacrcitary 

and niecibars of the univaralty of Toronto^ Hobarta Library staff who h*c*.lp 
on site during the satapllrig and photocopying steps. 



Time Table for Key Events in Project - First Quarcer 

1. Sariple selection at Univrsrsicy of Toronto co repreaent se^^aral subject 
categorias In deptn.^ Photocopy a ample monographs* table of contents 
aiid inde^^es* 

2* Dascriptive reporC on sanvpLe cotlfc^'Ctian, For eKample: 
Numbar of titles per subject categorias selected 
Nuniber of useful GOntent:s pages 

Number of subjact indexes found (pages of iiideK/book) etc. 



^ Due to funding delay ^ the first Quarterly Progress Report covers the 
period from June 15 through August 31, 1976. 



U 1 Dl^AeTMl,>tT OF NCALTH. 

THIS DOCU^^fNT MAS mU^m PF^^O. 

THE PCSION OR ORGftNllATtQN 0^lC»1N^ 
ATIWS IT PO! NTSdF ViiW OH OPINiOI^I 
STATIC PO MOT NeCillA8JL¥ 
Sf^TDPFiClAL NATIONAL iNSTiTyf e OF 
EOUCAT'ON POSITION 0» POLICY 



SiiMILE OP U^NIVERSITY OF T.0P,0'NTO LIBRAHY mCHINE-READABLE CATALOG 

(Acqiilslttons siiice 1966) 



The emphasis: -of our project is on experimentation with improved subject acc^Bs to 
ba^ks in the tamaoltieft atid social scienctis. As a part of this effort ^ our goal in sam- 
pai-ng post-l§66 acquisitions at the Unlv*srsity of Torotito Is the ability to outline re- 
liably the Indexing profile of monographs representing fields of knowledge we would wrk 
with, did n^ot aim at a sample rtpresentative of the Library's eTitire holdings in the 

social selamces and hoMnltieS'. Rather, we opted for in-daprh study through a repre- 
santativ'e sample of eight disciplines divided equally amorig the above two broad rireas. 
Eligible iCemfi for study In our project were riDnographs in English only. We used the 
UTLAS ConvGTsion, Project listings the s:3rrpling frame, which ^as Initially sorted to 



lansua^Q nicitar 



in, Is nnd ,sei 



as ended by the Uniwrsity of Toronto 



Llbxary^ thra drc'w a systeTriiatic ran-don sample of 1592 iCems^ as seen in Table 1* 

We belleve= this selection presents a f^cod balance between the areas cf knowledge con- 
cernltit^ uss with an cy'-e on potential user demand in a typical acadcinic setting. 

In addition, w decldad to study t.TO selGcticns in their entirety; i.e., eKhnustive 
lietin'^s of the Universit? of Toronto, Eobarts LllDrary holdings in Post-Conf ederatlcn 



Ontario History (300 item^) and U 



:d an 



Plannln^ (5 23 items)* These t^o selections will 



allow thorough searching for reml requesr^s po^ed to the Robarts Library reference staff. 

Taken together ^ we will work with auginented records of 2415 monographs. The sarnple 
and the two selections vtll be k^-pt separate for purposes of analysis* 



Siniple Compos! tionj n ^ 
Humaraitles 



mBiE_i^ 

L592^* and Complete S^lMtlDrns (n ^ 823). (Grand Totals ^ 2415 itenis) 
# of Items Soeial Sciences # of items 



1, Phtlosap'hyj 152 

BC logic, 

BB Aesthetics 

BJ Ethica 

2, History^ 1^9 
OE Craeeo-toman ^orld 

DF Ancient €raece 
DG Ancient Italy 
Rome to 4? 6 

3, f^tmt 201 
NH Seyipture 

mi Art Applied to Industry 
DeeoMtion & Crnament 

4, ilttraturei 320 
m 15 60-3300 Dram 

fOTAL Hu^anitlea 812 



S_o_eiaI ScienCfes 

5. Psychology: 
BF 1-990 

6 . Anthropology * 

1^696 

7. Public Fimnce: 
HJ 

Sociology ^ 
m U221 

TOTAL Social Sctencies 



346 



144 



136 



154 



780 



CQ!: nplete Selection for Tyo Fields 

1, Urbim Plaiininl.., 

tJrbam Rede%relopment I 523 
HT 166-177 

2, pvost-^Canfederatton 

Ontario Hiatorys 3TO 

F 552©-SS47 



* thlB im updated vftralon of table pre^enJted in Atig,t:iBt "Occasional NwsletterJ' 
^* Total estimated population of Ingllah langmge^ nGnL-serlal Items N^16,910^ mMi 8,601 
^^^r 51%) In the Muwanltl(BS» and 8,310 Co^ ^^%) the Soc^ial Scilinc€ dlsclplliMs, 



Our first ♦^Occasional Ne^:sletter" (kxi^uBt 1976) presented ths structure of our 
sample drawn using rtie UTLAS ConversiDn project listiT^gs, Ir; preparation for the 
preltminary analysis^ Siace then we have, contlni'ici photocDpying indexes, tables 
of contents^ illustration ; Ists, mnpa^ etc. fron asch. monograph ^ in order to 
prepare results as follo'wsi 

1) a.ctu.al nurnbeT ^of t oriop,rapris selected 

2) number of i^O'aograpbs ^xdth mu. index 

3) ntitriber of rionofgrriphs without B-n itLdsK-, 
table of cOTitipnua. or lists 



4) number of monogrnphs that coiitain a 
table of coxiteiits bL4t no ir.deK 

5) mmhBT of monographs that have no 
table of contents or iudax but have 
some kind of list of illustrations j 
mapBs or tables 

6) Attrition rate - mmber of missing 

itcns snd dciotes 

7) Profiles of tha cDti tents and 
inde%. matarial for each class of 
monograpns 

Our progress repert at this time (Septomb'er 1976) oiutllmet the resu'lts we 
liave compiled so fat* One hundred t\?e'nty--f l^fe i tarns have althac' m€. beetii located! 
or fully processed as yet*. Incoming data regard lug their IndBxesi and tables mi con-^ 
tSEits. would change the profiles genorated as of noy!^ Some of these 125 Items-* r^ay 
fall into the sixth catefory^ attritlori. 7o datej 61 iteM hm^e bean deleted frotn 
the original sample- as dupliriateisu, naii-Bnglish books> or joyrnal articles errdnaously 
listad as TOnO:graph.s foe the UTLAS Convar alAoni Proj act . Refer to Table 1, the rocord 
of our sample. 



table Two' below reports the current status €f mtiograplh aTialysls as £ollo^-s: 
Colunin 1 ^ numbar of ttettis located process'ed and i^eady lor the nBKt phase* 
Columfi i - indiica^es the remaining; IteTHis that most be located and processed^ If 
not f oun dfd&l&t led * 

Coiuirtn 3 number of processed items in Colwmn 1 tlia.t have, usable table of contGnts 
Coiutnn 4 - percent of the processed that contain tabl^a of contemts 
Column 5 - number ot processed (Column 1) Items that have am index 
Coflumn 6 - pei'cent of proeessjedl (Column U Items that have an In^^^ex 
Colum\n 7 - the range o£ the number of pageH per Index 

Included in the appendix are histograifis of ^oeGuriences of number of pages 
pQT index. The mode for all disciplines is belo^^^ 10 pages of indm^^ a fresult ^blch 
makei our work imnt quarter) very manageable. 
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TABLE 2 



Sample 



# of Hot 
Items Processed 

(1) (2) 



Ta'ble of 
Contents 

Hi) (4) 



, , Philosophy 143 

BC Logic 

BH Aesthetics 

BJ Ethics 

!. History 126 
DE Graeco-Ronan World 
DF Ancient Greece 
DG Ancletiit Italy 
Rome to 475 

1. Arcs 185 
NB Sculpture 
NE Engraving 

NK Art Applied to Industry 
DecoratioTi A Ornament 

„ Literature 296 
PN 1560-3300 

Psychology 318 
BP 1-990 

. Anthropology 138 
GN 1-696 

. Public Finance 127 
HJ 

. Sociology 148 
HM 1-221 

Sample Totals 1481 



134 



94X 



13 



120 



95 



16 



,33 



72 



24 
28 
6 



111 



255 



309 



130 



107 



141 



1329 



86 



97 



94 



84 



95 



90% 



Index ■ 

#(5) (6) 



80 



56% 



99 



79 



89 



154 
250 
93 
70 
102 
937 



52 



79 



67 



55 



69 



63% 



Range of f' of 
Index Pages 



1-74 



1-141 



2-94 



1-35 



1-117 



1-42 



1-76 



1-27 



1-141 



XXXXXX XXXXXXXXXXXXXXXXXXXXXXXXXX 



Comp 1 e t e Co 1 le c t Ion 
Llstlna^s 

, Urban Plannlnf,. 
Uroan Redevelopment 
Hf 166-177 

. Post Confederation 
Ontario Hlatory 
F 5520-5547 

Coniplete Collec tion 

Total 



Grand Total 



516 



293 



809 



14 



467 



190 



657 



91 



65 



81% 



146 



88 



234 



28 



30 



29% 



1-43 



1-89 



1-89 



XXXXXXXXXXXXXXXXXXJCXXXXXXXXXXSXX 

2290 125 1986 871 1171 51% 1-141 



* See Hlstogtanis In Appendix 
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ANALYSIS OF THE CIL4MCTERISTICS OF INDEXES AND CONTENTS PAGES IN BOOKS 

During the past two^three week& we have been engaged in the process 
of observing and analyzing the cQmpositlon of indexes and tables of con-- 
tents from the monographs in all fields sampled. An overview of the 
variety of syntactical and physical relationships is In the draft stage 
nov* At this point, we are not willing to make any statement of gener- 
alities describing or defining indexes and their elements. These are 
non-standardized j free-^text, indexes and contents lists. 

The Indexes and tables of contents are being approached in several 
ways. Coniparisons may be made: 

(a) within one subject fieldj 

(b) by type of index (subject or subject St name) 
without regard to discipline^ or 

(c) between the table of contents and the index 
entries of one book. 

In quantitative terms, we are looking at the percent of "names'" versus 
"subject'' entries 3 the extent to which page ranges after an index entry 
occur, the possible relationship between subdivided indeK entries and the 
length and Dccurrence of page ranges, the approximate number of entries 
par chapter, per page, or for an entire book. 

Qualitative assessments are being focused on semantic relationships 
between words and phrases- In this respect, most notable are the number 
of entries which can only be properly understood within the context of the 
entire index or boo/k iltself • Words like the, the_nj which are automatically 
assigned to stop word lists for computerized subject searching cannot be 
easily discarded in searching index entries in books on logic where entries 
like "if . • • . thens*- -^lawj the moral" appear. 

We will continue these studies and begin comparisons across gubject 
fields^ all in preparation for our choice of selection rules. 
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INFOR>mL FINANCIAL REPORT 

Expended 

SALARIES Budset Total as of 8/31/76 

Project Director 14,453 6,527.00 

Assoc, PrDject Director h,950 5,206,00 
(Univarslty salary increase as of 7/1/75) 

Grad Res. Assts. 50% AY 4,940 

100% Sunnner 4,Dr30 2,925.75 

Secretary 6,500 926,29 

Keypiinchars (Hourly) 4,500 

Survey Assts. (Hourly) 4^000 453.75 
Prograramar (Hourly) ' LOO 



OTHER DIRECT COSTS 



Subtotal 43,508 16,038,79 

FRINGE BEJ^EFITS @ 23.5% 10,225 3,769.12 



Subtotal 53,733 19,807.91 



Travel U995 1,285.63 

Duplicating 3,000 909,15 

Supplies (Office) 1,500 113.37 
Equipment Rental 

EDP U850 

Office 2,542 290,00 
SERVICES 

EDP 1,870 121.48 

SDC Sub-CDntract 7,955 

Procesaliig 2,170 20.00 



Total 76,615 22,547.54 

This rapraaants our best estimate of expenditures to date. It la different from the 
official report from Syracuse Unlvarslty because of the noiroal delays In accounting 
orocedar^s in such a large institution. 

ERIC ' 



-7- 



Expected Eveats Some^what Delayed 

1, Arrival of machlae-readatie tape of iCems in sample fioin UTLAS was 
September 15, 

2, Computer terminal. Wheri It arrives^ will perform some searches 
in LIBCON to assess stat€^o£-art for subject access to tooks on 
present on-line recrieval services* 

3, Concurrently s we hope to develop separate access, on-ltiie, to a 
computer tape froni BRODMT t^hich has enriched su'bject headings 
for 800 books from DC 300-^339 sections of the Public library 
Catalog. 



Priority Tasks in NeKt Qnarter 

1, Reports on characteristics of subjcet indeKes and contents pages 
followed by development of definitive selection rules, 

2, Discussion with library users and reference librarians on optlmuia 
access and output format options for on-line searching. 

3, Preliminary cost analysis for data input. 

4, Preliminary preparation of machine-readable data for SDC/OHBIT 
sys tern. 
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£LR TUNDS EXPEHIMENr AT SYRACUSE UNIVERSirY 
TO IMPROVE SUBJECT ACCESS TO BOOKS 



Iha Council on Library Resourcea has 
£V^rded $U j615 to the Syracuae Unl- 
-vixsity Schcol Itif orma tloii Studies 
for an iKpeTltttint that may result In. 
littfrove-d sutjecc access to monographs 
ty augmentation of MARC (Wachlne Read-- 
able Cataloging? records, 

^o*klng w^itli m aamTle of tooka drawn 
frcm ti^e coUecciDiis of the University 
of Toronto and coraTriaing a numbar of 
8utj#et categories i« the humanities 
and social scietice^, tlie project plan 
Is to enLarge ttia Subject description 
eomtalf%ed Iti ch^ HAtC record of each 
lock by utilising ^ ^et of selection 
Tulie for clioaitng words and phrases 
fo^nd w^ithln ch^ t:«daK and/or table 
of ccsntints I Ttii file of descriptions 
for Ch^ boo'kB In tJe sampla vlll be 
proc^s^ad by ch^ Sustain Development 
CoTporatton ^s ORBIT Search Service* 
On^lln© comT^car^b^sfid subject 
se^rchas ean t:han made by both 
prcj^et: staff atid -otheTS vho have ac- 
cess to the Serv^ic^, The reaults of 
th^ axperiaenc ^ill te analyzed and 
ivalijatid to dicer*ime the feasibility 
and utility o£ pirfoTmlng on-line com-- 
pute* subject lear ches for monpgraphs 
i?l th fltich ei^ricliid records. 

Vhdle? the card cat4log works well when 
an author ot clcle 1^ T4no%?n* It Is 
XmmB iffflclenc for r^tTleiflng mater- 
ials hy pubjict^ since only a few 
Ircad Cirms can be u^ed for each Itea. 
Se^e^ai cur re^t cofQptit er-bas ed ab- 
itxaeting attd ifideatltig services have 



provldad far more detailed subject 
access to journal articles (e*g* Fs 
choioglc al Abs trac te t Index Hed icus 



Engineer iTig IndeK, etc,) than the 
ords do for books, Little 

done to provide better re- 
capability for th€ user of 
hs* By enriching existing 



MARC rac 
tias been 
tr leva 1 
monogr ap 
records 
searches 
be niada 
the subj 



for monographs, more specific 

in the '^free text" mode can 
by aearching every word In 
act description. 



According to Pauline Atherton, pro- 
fessor In the School of Information 
Studies and director of the project, 
if the results are successful, "book 
Indexers and publlehers may wish to 
take greater care in preparing in- 
dexes, knowing the use of their ef- 
forts for on-line computer-based 
searching , 



PROJECT S7k?l 

Pauline Atherton Project Director 
B.K.L* Ganova - Assistant Project 

Dlrec tor 

Bette Brlndle - Research Assoclatt 
Cheryl Willson - Secretary 

SUBJIGT ACCESS PROJECT 
School of Information Studies 
Syracuse University 
113 Euclid Avenue 
Syracuse* l^mw York 13210 
(315) 423-2001 
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SAMPLE OF UNIVERSITY OF TORONTO LIBKARY MACHI NE- READABL E CATALOG 

(Acquisitioni since 1966) 



The cfflphails of out project is on ex^- 
per itnen t a t ion with improved subject 
access to books In the humanities and 
social aciences. As a part of this 
effort, our goal In sampling post-1966 
acquisitions at the University of To- 
ronto is the ability to outline reli- 
ably Che indexing profile of inono-= 
graphs rapresanting fields of know-- 
ledge we would work %^ith# We did not 
aim at a sample representative of the 
Library's entire holdings in the so- 
cial sciences and. human ities . Rather^ 
we opted for in-depth study through a 
representative sample of eight dlsci'^ 
plines divided equally among the above 
two broad areas. Eligible items for 
study in our project were monographs 
in English only. We used the UTLAS 
Conversion Project listings as the 
sampling frame, which was initially 
sorted to exclude foreign language 
materials and serials^ as coded by 



the University of Toronto Library* 
We then drevy a systernatic randoni sam-' 
pie of 1600 items, as seen in Table 1* 
We believe this selection presents a 
good balance between the areas of 
knowledge concerning us, with an eye 
on potential user demand in a typical 
academic setting. In addition, we 
decided to study two selections in 
their entirety; i.e., exhaustive list-- 
ings of the University of Toronto, 
Eobarts Library holdings in Poat^ 
Confederation Ontario history (317 
Items) and Urban Planning (550 Iteras). 
These two selections will allow thor- 
ough saarchlng for real requests 
pDsed to the Robarts Library refer- 
ence staff* Taken together, we will 
work with augmented records of 2,460 
monographs- The sample and the two 
selections will be kept separate for 
purposes of analysis* 



TABLE 1 

Samgl^ coTOpo s 1 t i on , n ^l6O0* 



Humanities // of items Social Sciences ^/ of items 



1, Philosophy^ 151 5. Psychology! 349 
BC Logic BF 1-990 

BH Aesthetics 

BJ Ethics . 6, A.n thropo logy 1 145 

GN 1-696 

2. Hlstoryi 139 

DE Graeco--EoTiian World 7, Public Finance.* 137 

DF Ancient Greece lU 
DG Ancient Italy 

Rome to 476 8. Sociology^ ^ 153 

HM 1-221 

3* Arts: 197 

NB Seulpture Comp let e Listings 

NE Engraving 

NK Art Appliad to Industry 1. Urban Planning, 

Decoration & Ornament Urban Redevelopment* 550 

HT 166-177 

4, Literatures 329 

PN 1560-3300 Drama 2, Post Confederation 

Ontario History i 317 
. F 5520-5547 



* Total estimated population of English langtiag€p non-serial iteTOS N = 16,910, w^ith 
8»601 (or 51%) ill the Humanities, and 8,310 (or 49%) in Che Social Science 
dlsclp lines . 
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ANALYTICAL STUDY OF BOOK INDEXES 
. , , UNDERLAY 

From each mono^graph selected at the 
University of Toroiito , have xerDSted 
when available, IndeKes, table of con- 
tents and lioti of Illustrations, 
naps, figurta, tablasi etc. The pre- 
liminary analyils t^ill conslat of 
totals of tlie followiiigi 

1) actual nuraber of monographs 
ee lec ted 

2) QUinbet of laonographs with an 
Index 

3) nunber of moiiographs wlthoyt 
an tndeXj table of contents, 
or Hits 

4) i^ufflber of laoiiDgraphs that con- 
tain a table of contents but 
no IndeK 

5) nunbef □£ monographs that have 
no table of contents or index 
but have some kind of list of 
illus trat iona, maps, or tables. 

The above analyses will be broken down 
by L.C* class with totals for the en- 
t ire sample , 

In addition, an average nuab#r sof in- 
dex pages, table of contents pages, 
and lists of i llus tra tlona * raapSs* etc, 
vlll be computid for each class and 



the entiri sample. We will also cal- 
culate the average nambfir of entrlas 
per index page^ table of contents 
page and list pag^ for each L.C* clasa 
and the entire saspla. 

If possible^ urn will try Co aaaess 
dlffetences by date of pybl ea t ion . 
Alchough thm saraple was selected frDm 
post-1966 acqulsltlofis, it mmy contain 
older Items* 



ANYONE IWTERESfED? 



By this time 
newsletter , y 
ing what dlff 
make to you i 
preparation if 
question j vm 
holdings of o 
the data base 
the Univer iit 
We suipact th 
these titles 
you would be 
such a data b 
how much more 
to your colle 
yoy presently 



in your reading of our 
QU are probably wcnder- 
erence qur project will 
n your library , In 
or our answer to that 
hope to sample check the 
ther libraries against 

we are creating from 
y of Toronto collection, 
at you have many of 
in your library and that 
interested In searching 
ase on-line to determine 

access It will provide 
ctlofi than the access 

have . 



If interested In more detrils about 
thls> drop us a line. 



A MOTE OF APPRECIATION 

Many people need to be thanked for encouraging us to embark on this project, 
W€ are very appreciative of your enthusiastic support. Without It, we wotild 
not have embarked on such a tedious but important task. 

We hope that this and subsequent newsletters will keep you itiformed of our 
progress* 

We also hope you will continue to be interested in our work and send us com- 
ments ^hen you have Ideas to share with us. 
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